Evaluating semantic models with word-sentence relatedness

نویسندگان

  • Kimberly Glasgow
  • Matthew Roos
  • Amy J. Haufler
  • Mark A. Chevillet
  • Michael Wolmetz
چکیده

Semantic textual similarity (STS) systems are designed to encode and evaluate the semantic similarity between words, phrases, sentences, and documents. One method for assessing the quality or authenticity of semantic information encoded in these systems is by comparison with human judgments. A data set for evaluating semantic models was developed consisting of 775 English word-sentence pairs, each annotated for semantic relatedness by human raters engaged in a Maximum Difference Scaling (MDS) task, as well as a faster alternative task. As a sample application of this relatedness data, behavior-based relatedness was compared to the relatedness computed via four off-the-shelf STS models: n-gram, Latent Semantic Analysis (LSA), Word2Vec, and UMBC Ebiquity. Some STS models captured much of the variance in the human judgments collected, but they were not sensitive to the implicatures and entailments that were processed and considered by the participants. All text stimuli and judgment data have been made freely available.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Relatedness Based on a Word Thesaurus

The computation of relatedness between two fragments of text in an automated manner requires taking into account a wide range of factors pertaining to the meaning the two fragments convey, and the pairwise relations between their words. Without doubt, a measure of relatedness between text segments must take into account both the lexical and the semantic relatedness between words. Such a measure...

متن کامل

CDSMs for Semantic Relatedness and Entailment

Distributional Semantics Models (DSMs) have become widely accepted as successful models for lexical semantics. However their extension to handling larger structural units such as entire sentences remains challenging. Compositional DSMs (CDSMs) aim to successfully model sentence semantics by taking into account grammatical structure and logical words, which are ignored by simpler models. We expl...

متن کامل

Sentence Context Prevails Over Word Association in Aphasia Patients with Spared Comprehension: Evidence from N400 Event-Related Potential

Behavioral and event-related potential (ERP) studies on aphasia patients showed that lexical information is not lost but rather its integration into the working context is hampered. Studies have been conducted on the processing of sentence-level information (meaningful versus meaningless) and of word-level information (related versus unrelated) in aphasia patients, but we are not aware of any s...

متن کامل

Evaluating the Success of the Visual Learners in Vocabulary Learning through Word List versus Sentence Making Approaches

Thisstudy sought to evaluate the learners' achievements with the visual learning style when exposed to the sentence making and word list approaches. On that account, 45 basic level participants who studied at the Iran Language Institute (ILI), Bushehr, took part in this research study. At the outset, the learners were given Barsch learning style inventory (1991) to determine the learners' learn...

متن کامل

Evaluating the Success of the Visual Learners in Vocabulary Learning through Word List versus Sentence Making Approaches.

Thisstudy sought to evaluate the learners'''' achievements with the visual learning style when exposed to the sentence making and word list approaches. On that account, 45 basic level participants who studied at the Iran Language Institute (ILI), Bushehr, took part in this research study. At the outset, the learners were given Barsch learning style inventory (1991) to determine the learners''''...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1603.07253  شماره 

صفحات  -

تاریخ انتشار 2016